Modeling Heterogeneous Networks for Information Ranking, Enrichment and Resolution on Microblogs
نویسنده
چکیده
Microblogging, a new type of online information sharing platform through short messages of up to 140 characters, has grown up quickly and received increasing attentions in recent years. A microblogging platform (e.g., Twitter) enables both individuals and organizations to disseminate information, from current affairs to breaking news in a timely fashion, which makes it a valuable knowledge source with super-fresh information. For example, during Hurricane Irene in 2011, updates from users living in New York City and transportation/evacuation posts from the government are very useful information for people to keep track of the disaster. Therefore, conducting related Natural Language Processing (NLP) research on this new genre is demanded to assist knowledge mining and discovery. Different from the semi-structured knowledge bases (e.g., Wikipedia) and the traditional news, the informal microblogs tend to be noisy, short, and informal. And the phenomenon of information implicitness is more prominent and pervasive in microblogging. These characteristics bring unique challenges to people’s reading and understanding of the informal microblogs, as well as many knowledge mining and discovery tasks. Thus, in order to alleviate these problems, in this thesis we propose to filter noisy and uninformative information, enrich the short microblogs with background knowledge from knowledge bases such as Wikipedia, and resolve the informal and implicit information to their regular referents. To achieve our goals, we propose to leverage and model heterogeneous information networks (HINs), in contrast to most existing NLP approaches on traditional genres (e.g., news) that only explored single type of information (e.g., texts). Microblogging contains heterogeneous types of information from social network structures to cross-genre linkages, forming rich HINs. By designing effective approaches to model both unstructured texts and structured HINs, we can incorporate additional evidence from HIN structures beyond texts. In this thesis, we present different approaches to construct HINs from crossgenre, cross-source, and cross-type information by incorporating the existing clean social relations, as well as performing deep content analysis with some of the well-developed
منابع مشابه
A Latent Representation Model for Sentiment Analysis in Heterogeneous Social Networks
The growing availability of social media platforms, in particular microblogs such as Twitter, opened new way to people for expressing their opinions. Sentiment Analysis aims at inferring the polarity of these opinions, but most of the existing approaches are based only on text, disregarding information that comes from the relationships among users and posts. In this paper we consider microblogs...
متن کاملA novel key management scheme for heterogeneous sensor networks based on the position of nodes
Wireless sensor networks (WSNs) have many applications in the areas of commercial, military and environmental requirements. Regarding the deployment of low cost sensor nodes with restricted energy resources, these networks face a lot of security challenges. A basic approach for preparing a secure wireless communication in WSNs, is to propose an efficient cryptographic key management protocol be...
متن کاملDistributed Incremental Least Mean-Square for Parameter Estimation using Heterogeneous Adaptive Networks in Unreliable Measurements
Adaptive networks include a set of nodes with adaptation and learning abilities for modeling various types of self-organized and complex activities encountered in the real world. This paper presents the effect of heterogeneously distributed incremental LMS algorithm with ideal links on the quality of unknown parameter estimation. In heterogeneous adaptive networks, a fraction of the nodes, defi...
متن کاملAn efficient non-repudiation billing protocol in heterogeneous 3G-WLAN networks
The wireless communication with delivering variety of services to users is growing rapidly in recent years. The third generation of cellular networks (3G), and local wireless networks (WLAN) are the two widely used technologies in wireless networks. 3G networks have the capability of covering a vast area; while, WLAN networks provide higher transmission rates with less coverage. Since the two n...
متن کاملNetwork Selection in Heterogeneous Wireless Environment Using Decision Making Algorithms-topsis and Promethee
Forthcoming wireless environment is a fusion of numerous networks with diverse technologies deployed by individual operators. In such an environment, innovative network selection methodologies are required not only to provide “always best connected” service to mobile users but also to maximize network operator’s revenue. To fulfill such requirements, multiple attributes from each network are to...
متن کامل